Learning and Reusing Goal-Specific Policies for Goal-Driven Autonomy
نویسندگان
چکیده
In certain adversarial environments, reinforcement learning (RL) techniques require a prohibitively large number of episodes to learn a highperforming strategy for action selection. For example, Q-learning is particularly slow to learn a policy to win complex strategy games. We propose GRL, the first GDA system capable of learning and reusing goal-specific policies. GRL is a case-based goal-driven autonomy (GDA) agent embedded in the RL cycle. GRL acquires and reuses cases that capture episodic knowledge about an agent’s (1) expectations, (2) goals to pursue when these expectations are not met, and (3) actions for achieving these goals in given states. Our hypothesis is that, unlike RL, GRL can rapidly fine-tune strategies by exploiting the episodic knowledge captured in its cases. We report performance gains versus a state-ofthe-art GDA agent and an RL agent for challenging tasks in two real-time video game domains.
منابع مشابه
Self-Regulation, Goal Orientation, Tolerance of Ambiguity and Autonomy as Predictors of Iranian EFL learners’ Second Language Achievement: A Structural Equation Modeling Approach
The identification of the cognitive, affective, social and even physiological factors affecting second or foreign language learning routes and rate has for long been a challenging aspiration for second language researchers. However, a recent preoccupation of the researchers in this area has been the study of the combinatorial impacts of such factors on second or foreign language learning proces...
متن کاملCase-Based Learning in Goal-Driven Autonomy Agents for Real-Time Strategy Combat Tasks
We describe a study on using case-based learning techniques in a goal-driven autonomy (GDA) agent for real-time strategy games. The two case bases in our Learning GDA (LGDA) agent store (1) the expected states that an agent can reach when executing an action in and (2) the next goals to pursue when a discrepancy occurs between the expected and encountered states. We report on an ablation study ...
متن کاملApplying Appraisal Theories to Goal Directed Autonomy
Appraisal theories (Roseman and Smith, 2001) are psychological theories exploring how humans evaluate situations along a number of appraisal dimensions, many of which compare the situation to the current goals. We report on the application of appraisal theories to focus the learning of policies in complex domains via Soar 9’s built-in reinforcement learning mechanism. In addition, we describe h...
متن کاملGoal-Driven Autonomy in Planning and Acting
To operate autonomously in complex environments, agents must perform actions, sense the environment, and respond to new situations. Traditional approaches face difficulties with incomplete environment models, goal specification, and engineering domain specific control knowledge. We believe goal reasoning will address these challenges enabling agents to better respond to unexpected circumstances...
متن کاملIntegrated Learning for Goal-Driven Autonomy
Goal-driven autonomy (GDA) is a reflective model of goal reasoning that controls the focus of an agent’s planning activities by dynamically resolving unexpected discrepancies in the world state, which frequently arise when solving tasks in complex environments. GDA agents have performed well on such tasks by integrating methods for discrepancy recognition, explanation, goal formulation, and goa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012